NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Logistic-Beta Processes for Dependent Random Probabilities with Beta Marginals

https://doi.org/10.1214/25-BA1541

Lee, Changwoo J; Zito, Alessandro; Sang, Huiyan; Dunson, David B (December 2025, Bayesian Analysis)

Free, publicly-accessible full text available December 1, 2026
Inferring Covariance Structure from Multiple Data Sources via Subspace Factor Analysis

https://doi.org/10.1080/01621459.2024.2408777

Chandra, Noirrit Kiran; Dunson, David B; Xu, Jason (April 2025, Journal of the American Statistical Association)

Free, publicly-accessible full text available April 3, 2026
Exact sampling of spanning trees via fast-forwarded random walks

https://doi.org/10.1093/biomet/asaf031

Tam, Edric; Dunson, David B; Duan, Leo L (January 2025, Biometrika)

Summary Tree graphs are used routinely in statistics. When estimating a Bayesian model with a tree component, sampling the posterior remains a core difficulty. Existing Markov chain Monte Carlo methods tend to rely on local moves, often leading to poor mixing. A promising approach is to instead directly sample spanning trees on an auxiliary graph. Current spanning tree samplers, such as the celebrated Aldous–Broder algorithm, rely predominantly on simulating random walks that are required to visit all the nodes of the graph. Such algorithms are prone to getting stuck in certain subgraphs. We formalize this phenomenon using the bottlenecks in the random walk’s transition probability matrix. We then propose a novel fast-forwarded cover algorithm that can break free from bottlenecks. The core idea is a marginalization argument that leads to a closed-form expression that allows for fast-forwarding to the event of visiting a new node. Unlike many existing approximation algorithms, our algorithm yields exact samples. We demonstrate the enhanced efficiency of the fast-forwarded cover algorithm, and illustrate its application in fitting a Bayesian dendrogram model on a Massachusetts crime and community dataset.
more » « less
Full Text Available
Bayesian Pyramids: identifiable multilayer discrete latent structure models for discrete data

https://doi.org/10.1093/jrsssb/qkad010

Gu, Yuqi; Dunson, David B (March 2023, Journal of the Royal Statistical Society Series B: Statistical Methodology)

Abstract High-dimensional categorical data are routinely collected in biomedical and social sciences. It is of great importance to build interpretable parsimonious models that perform dimension reduction and uncover meaningful latent structures from such discrete data. Identifiability is a fundamental requirement for valid modeling and inference in such scenarios, yet is challenging to address when there are complex latent structures. In this article, we propose a class of identifiable multilayer (potentially deep) discrete latent structure models for discrete data, termed Bayesian Pyramids. We establish the identifiability of Bayesian Pyramids by developing novel transparent conditions on the pyramid-shaped deep latent directed graph. The proposed identifiability conditions can ensure Bayesian posterior consistency under suitable priors. As an illustration, we consider the two-latent-layer model and propose a Bayesian shrinkage estimation approach. Simulation results for this model corroborate the identifiability and estimatability of model parameters. Applications of the methodology to DNA nucleotide sequence data uncover useful discrete latent features that are highly predictive of sequence types. The proposed framework provides a recipe for interpretable unsupervised learning of discrete data and can be a useful alternative to popular machine learning methods.
more » « less
Full Text Available
Dimension-Grouped Mixed Membership Models for Multivariate Categorical Data

Gu, Yuqi; Erosheva, Elena E.; Xu, Gongjun; Dunson, David B. (April 2023, Journal of machine learning research)

Full Text Available
Latent Nested Nonparametric Priors (with Discussion)

https://doi.org/10.1214/19-BA1169

Camerlenghi, Federico; Dunson, David B.; Lijoi, Antonio; Prünster, Igor; Rodríguez, Abel (December 2019, Bayesian Analysis)
null (Ed.)
Full Text Available
Extrinsic Local Regression on Manifold-Valued Data

https://doi.org/10.1080/01621459.2016.1208615

Lin, Lizhen; St. Thomas, Brian; Zhu, Hongtu; Dunson, David B. (July 2016, Journal of the American Statistical Association)

Full Text Available

Search for: All records